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A Computer Science Version of Godel’s Theorem 



B. J. MacLennan 
Computer Science Department 
Naval Postgraduate School 
Monterey, CA 93940 

Abstract: 

We present a simplified proof of Coders theorem by appealing to well-knoTO program- 
ming concepts. The significance of Godel’s result to computer science, mathematics 
and logic is discussed. 

1. Intrcduction 

Most computer scientists have heard of GodeTs Incompleteness Theorem, and many 
have seen it "proved.” Yet, Gbdel’s theorem usually remains a mystery. The proof, as 
it’s generally presented, is almost incomprehensible. Students usually come av^ray ^vith 
a feeling that they’ve somehow been tricked. They v/ould probably ignore the theorem 
altogether, but they've been told that it’s very important, that it sets limits on logic 
and rational thought. It is therefore pairticularly unfortunate that both the theorem 
and the proof are presented so mysteriously. 

Part of the difficulty of the theorem is that it was ^.vritten in the early 1930’s, before 
the widespread use of general purpose computers. Gbdel’s Theorem is a theorem about 
computers —which wets formulated before there were many computers. 3y ’asing ideas 
familiar to any programmer. Gbdel’s Theorem can be made easily understandable. 
There is no excuse for doing mthput these concepts. 

Section 2 presents an informal proof of GbdeTs theorem based on ideas from pro- 
granaming. Section 3 generalizes the theorem to other questions of decidability, and 
Section 4 relates the results to symbolic logic. Finally. Section 5 discusses some evader 
Implications of the theorem. 



2. The lialtiiig Prcblem 



Before proving Godel’s theorem, I v/111 prove an equivalent result due to Turing: Tne 
Halting Problem, v/hich asks, "For any given program cind any given input, is there is a 
way of deterrnining whether that program wU halt when given that input?” For many 
programs this can be decided on a case by case basis. 'What is wanted is a generai pro- 
cedure that, given any description of a program P, and any input x, '^vlll tell us whether 
P halts when given x. In particular, we want a program that can decide the halting 
question. 

Before embarking on a search for a decision procedure for the halting problem, it is 
'^se to ask if such a procedure is possible. The classic approach tc determine whether 
something is possible or not assumes its existence and shows that this assumption 
leads to a contradiction. Self-contradictory things don’t e:dst. 

The proof I’ll discuss is related to the famous Liar’s Paradox of Epimenides: 

This statement is a lie. 

is the above statement true or false? If we assume it’s true, then we must conclude 
that it’s false (since it’s a lie); if we assume it’s false, then we must conclude it's true 
(since It’s not a lie). Put simply, this statement contradicts itself. Since this state- 
ment can be neither true nor false, we must conclude that it makes no factuei claim at 
ail. It was considerations of paradoxes like this that ^ed Godel and Tearing to their 
proofs. 

In the statement of the theorem and its proof we mil use a Pascal-Iike programming 
language called L. You will see, however tbmt the result does not depend on the 
language used, and remains true for any realistic programming language. 

Dejiniiion: A procsdui'e, called ‘Halts’, is called a decisian procedure for the halting 
problem in L if, given (l) any string p representing a program P in L, and (2) any 
string X, Haits(p,:r) - true if P{x) halts, and Haltsfj),^) = false if P[x) gees into an 
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infinite loop. 



Theorem: The decision procedure Halts for the halting problem in L is impossible. 
Proof: Assume that the procedure Halts exists. 

Halts(p,: 2 :) tells us whether the program P described by p mil halt v/hen applied to 
the string x. Since o is itself a string, and we want to know what mil happen when P is 
applied to any string, we can legitimately ask what would happen if P were applied to 
p, that is, il z-p. This is answered by the procedure call Halts(p,p). 

Now, consider the following definition of the procedure Q \n L\ 

procedure Qip) is 
definition of Halts 
begin 

if Halts(p,p) then 
label: goto label 
else return; 
end Q: 

The meaning of "definition of Halts” is that the defimtion of Halts is to be placed at 
that position in the definition of Q\ this permits Q to make use of Halts. The behavior 
of 0 is as follov/s: if Pip) halts, then Q goes into an infinite loop; othernse ? immedi- 
ately returns. 

The character string above, which defines the procedure s a program m A. 
this character string q : 

q = ' procedure 0(p) .... end Q;‘ 

Since g is a string representing a program in L, it is a legitimate input to the pro- 
cedure Q. Therefore we can consider the behavior of Q{q). Let’s trace the steps, ^(g) 
first asks if Haits(g,g). Since Halts is a decision procedure fcr the halting problem, it 
returns a Boolean (true or false) result indicating whether the procedure represented 
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by q v/ill halt when applied to q. But the procedure represented by q is 0 itself, so 
Halts(g.g) asks whether Q{q) halts. 

Now, either Q{q) halts or it doesn't. If it halts, then Haits(g,g) retm^ns true, and 
Q{q) goes into an infinite loop. This contradicts our assumption that Q{q) halts. 
Therefore, let’s suppose the opposite, that Q{q) doesn't halt. Then, Halts(g.gr) returns 
false and Q{q) immediately returns, i.e.. halts. This ageiin contradicts our assumption. 
Since there are only two possibilities {Q{q) halts or it doesn't) and both lead to a con- 
tradiction, v/e conclude that our assumption, that the procedure Halts is possible, v/as 
contradictory. Therefore there can be no such procedure as Halts. OED. 

IVe have skimmed over several issues in the proof of the hadting theorem. In view of 
its remarkable nature, we mil reconsider them. In particular, a proof by contradiction 
shows that at leojst one of the assumptions is contradictory; it doesn't tell us which one. 
In this case we made two major assumptions: that it is possible to mrite Halts in L and 
that it is possible to mite ^ in L. Maybe Halts can be v;rittenbut Q can't. 

I haven’t been too specific about what is and isn’t legal in the language L. 1 have 
said that L is typical in the sense that anything we prove about L mil hold for any real- 
istic language. In peirticuiar, obsen/e that to be able to define 0, we only need to be 
able to do the foilowng things: 

• Call another procedure (e.g. Halts). 

• Do a conditional test. 

» Go into an infinite loop. 

• Return immediately (i.e., not go into an infinite loop). 

These are things that can be done in any rsedistic programming language. Hence we 
must conclude that there is no decision procedure for the halting problem for any real- 
istic programming language. 
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Are there any other hidden assumptions in our proof? Yes, we have also assumed 
that our language can handle arbitrarily long character strings. Since programs can 
be arbitrarily long, this is necessary if we are to be able to write a Halts procedure 
that's applicable to all programs. Vfhat about languages that don't have character 
strings? The proof still goes through if there’s any data type (say, integers) that is 
equivalent to arbitrary bit strings. The reason is that a character string is just a string 
of bits. It should be noted, however, that the proof does not go through if any resource, 
either space or time, is limited to a particular amount. Since all real computers have 
such limitations, the theorem only applies to idealized computers vhth unbounded 
resources. 

3- ether Decision Problems 

Having found that there is no decision procedure for the halting problem, a natural 
question to ask is: What other questions can't be decided by a program? The answer 
is: almost any property you care to name. Suppose you want to write a decision pro- 
cedure Does-D(p,x) that, for any behavior D, tells you if p does D when applied to x . 
Then consider this procedure definition: 

procedure Q(p) is 
definition o/Does_D 
begin 

is Does_D(p,p) then 
don't do D 
else do D 
end 0; 

In the case of the Halting Problem D was 'halts', so don’t do i? was accomplished by 
'label: geto label’ and do Z^was accomplished by ‘return’ 

Call this string g. 3y the same reasoning as before you can see that Q{q) does D if 
and only if Q{q) doesn’t do D. Hence we have a contradiction. The only vmy the proof 
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can fail to go through is if the language is so weak that it is not possible to wite code 
that does D and doGsri't do D. For any realistic programming language, and property D 
of interest, it will be possible to %vrite Q in the language. 

There are some properties of programs that can be decided by programs. For 
example suppose we wanted to wite a function Halts_Quickly(p,x) which determines if 
p halts v/ithin 100 years when applied to x. Intuitively it seems like we ought to be able 
to write such a function: we just run P{x) until it halts or until 100 yeeu's are up, which- 
ever occurs first. Thus, HaltSL-Quickly(p,x) always returns an answer, although we may 
have to wait 100 years to get it! 

But, can't we write a paradoxical procedure Q that runs less than 100 years only if it 
doesn’t run less than 100 years? Let’s try and do this. If Halts_Quickly(p p ) is true, we 
will go into an infinite loop (looping for over 100 years would be adequate); othennse v/e 
^<vlll return immediately. The resulting program is: 

procedure Q(p) is 
... definition of Halts_Quickly ... 

begin 

if Halts-_Ouickly(p,p) then Zoop forever 
else return; 

endQ; 

New consider the application Q(q). Dees it halt quickly or not? 

Suppose Q{q) halts vhthin 100 years; then, Halts«Quickl 3 -^(g returns true, and 
Q{q) loops forever. This contradicts our assumption that Q{q) halts quickly. 

Now suppose Q{q) does not halt within 100 years; then HaltaJ5uickly(g ,g ) is false, 
and Q{q) returns immediately. Does this lead to a contradiction? No, since the run- 
ning time of Q{q) includes the time necessary to execute HaIt3J2uicldy(g ,g). This 
could cfell be more than 100 years (as it would be if we implemented it in the naive way 
described pre^/iously). Hence there is no contradiction. 
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This does not mean we coji write Halts_Quickly. only that it does not lead to a con- 
tradiction in the same way as Halts, On the other hand, our informal implementation 
discussion shows that Halts_Quickly could actually be programmed. 

Thus, the halting theorem does not say that it is impossible for a program to decide 
any property of all programs. In fact it is the unlimited properties (e.g., halts eventu- 
ally) which are undecidable; limited properties (e.g.. halts uAthin ICO years) are often 
decidable. 

4. Symbolic Logic 

You have probably heard that Gbdel’s theorem has something to do mth the limitations 
of logic; you may have even heard that it sets bounds on rational thought. Yet. I have 
not said anything about logic in the preceding sections. 1 have only discussed the limi- 
tations of what programs can tell us about other programs. To understand the connec- 
tion between programs and logic, it will be necessary to discuss some topics in the 
foundations of mathematics. 

A major concern of mathematicians in the 19th century was rigor, standards of 
proof by which mathematicians could ensure that their theorems were true. Mathema- 
ticians were anxious to secure the foundations of mathematics, to ensure that there 
were no contradictions implicit in mathematical theories. Mathematicians divided 
themselves into several, often antagoristic, schools depending on the approach to foun- 
dations that they took. One of the mcst important of these schools, founded by the 
famous mathematician ua^/id Hilbert (1862-1946), was called /crmuhsm. 

It had long been observed that deductive reasoning was often formal; that is. the 
correctness of a deduction could be decided on the basis of the form of the argument 
vTlthout reference to the meanings of the terms used in it. This is particularly true of 
mathematical proofs: they are often accomplished by successively transforming a for- 
mula into new formulas. These transformations are simple mechanical rearrange- 
ments of symbols. For example, we can prove r-hl=5 implies r=4 by these 
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transformations: 



x + l = 5 + = (4+l)-l 

=> x + (l-l) - 4+(l-l) 
x+0 = 4+0 
=> x=4 

The formedists believed that all mathematics could be reduced to foTTnai systems. A 
formed system has two parts: (1) a set of initioL strings, and (2) a finite set of comput- 
able transformation rules. By appl 3 dng the transformation rules to the initial strings, 
a formal system generates a set of derived strings. As you can see. a formal system is 
a primitive sort of program, in which (1) the transformation rules are the operations 
and statements of the program. (2) the initial strings are the input data, and (3) the 
derived strings are the intermediate and output data that result from applying the pro- 
gram to the input data. 

The formedists believed that they could make mathematics rigorous by reducing it 
to a formal system. By letting the initial strings of a formal system represent the 
mathematical axioms, and the transformation rules represent the deductive rules of 
inference, the derived strings would represent just those theorems that could be 
deduced from the axioms. Since the transformation rules were mechaniccd symbol 
manipulation operations, independent of the meamng of the symbols, it seemed that 
this approach would elinunate both non-rigorous ideas and the use of intuition from 
mathematical proofs. 

Another important goal of the formalist program was to establish the consistency of 

mathematics. Consistency means that it is not possible to prove both a proposition P' 

# 

and its negation ‘not P'. In other words, consistency means that there are no contrad- 
ictions inherent in the axioms. 

• The set of initial strings is requtrsd to be rucvrsive, i.e., a computer program can decide r/hether a given 
string is an initial string or not. 
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Mathematicians are also concerned with the compLeteness of their a^dcms: how 
many axioms must be included in order to prove all true theorems? In other v^ords, if 
a theorem is true, and can be expressed in the symbols of the formal system, it should 
be possible to derive it from the initial strings (axioms) by using the transformation 
rules (rules of inference). 

Formal systems gave the formalists hope that they would be able to prove the con- 
sistency and completeness of some form.alization of mathematics. This is because for- 
mal systems are themselves mathematical objects. Therefore, it is possible to prove 
things about formal systems by using mathematical techniques. In particular, they 
wished to show the consistency and completeness of mathematics by using rules of 
inference that were so simple that no one could object to them. They wanted to use 
flnitistic rules, that is, rules that could be executed on a computer. 

You may have already realized the dubiousness of this proposition. A formal system 
is essentially a prograimming language, and we know that it's impossible for a program 
to decide much of anything about a reasonably powerful programming language. 
Hence, if we conceive of mathematics as a formal system (i.e., programming language), 
then we can see that it will be impossible fcr mathematics to decide (prove) much of 
Interest about any reasonably powerful formal system, including itself. Hence, it’s not 
likely to be possible to use mathematics to prove the ccnsistency and completeness of 
mathematics. This is m fact the case. 

Let’s consider how we could prove this result more carefully. Suppose Tie 
mathematical language is powerful enough to eicpress the theorem p halts on c: , fcr 
any program p (in some prograrnxning language) and for any input x. Then we Imow 
that this formal mathematical system must be incomplete, because othenvlse v/e could 
either prove or disprove 'p halts on c:’ for each program p and input c:. This would 
solve the halting problem, which we just shov/ed is impossible. Hence, the central issue 
is whether our formal mathematical language is powerful enough to state the theorem 
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’_p halts on x’. 



Next J argue that the theorem ‘p halts on x' can be stated in any reasonably pov^-er- 
ful formed mathematical language. In particular, if we can talk about integers, 
sequences and sets of integers, and the arithmetic operations, then we can express 'p 
halts on X ’ . 

First, since 'p halts on x’ is a proposition about programs, it must be possible to 
talk about programs mathematically. A program in any programming language can 
always be written as a finite sequence of characters, and characters can be thought of 
as small integers (say, integers in the range 0 to 255 in the ASCII code), so any pro- 
gram can be represented by a finite sequence of integers, which is a mathematical 
object. If we assume that our programs take strings as inputs and return strings as 
outputs, then the inputs and outputs of programs can also be represented as 
sequences of integers. 

Yfe now know how to represent the program p and the input x; how do we express 'p 
halts on x'? Teat this means is that if p is applied to x, then some output y ^.vlll result. 
Now, if we had a f ’unction 'apply (p,x)' that returned the result of appMng p to x 
(assuming it halts) then we could express 'p halts onx’ mathematically: 

p halts onx if and only if there exists a y such that y - apply(p ,x) 

Notice that 'appiy‘ is just a mathematical function that takes two sequences of integers 
and retvnns a sequence of integers. 

7ne ‘apply function is essentially an interpreter for our programming language. 
Can one define an interpreter using just mathematics? LISP programmers ’vlll realize 
that the ansv/er is “yes," because programming in pure LISP is essentially program- 
ming in mathsmatics, and a LISP interpreter can be written in about twent^z-five lines 
of LISP. If you are not familiar ivith LISP you may talce a little more convincing. 

Think about the vmy a program executes: at each stage of its execution it takas the 
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values of a number of variables and computes new values for a (possibly overlapping) 
set of variables. This alteration in the variables is determined by the current instruc- 
tion being executed in the program. Further, each instruction designates another 
instruction ais the next one to execute. 

Now, the memory that contains all the variables in a program is just an array of bits, 
so the memory can be represented as a sequence of integers. Also, the instmction- 
pointer, which designates the next instruction in the program to execute, is just a 
sequence of bits, so it can also be made peirt of this sequence of integers. Notice that 
this sequence of integers, which we call the stats of the program, is all that changes 
from one step of the program to the next. 

In order to avoid a lot of very tedious details, I have to skip some of the finer points 
of this argument. Therefore, suppose that we have defined a mathematical function 
‘Next-State’ such Next_3tate(jD,s) is the state resulting from executing one step of the 
program p in state s. (The instruction to execute is determined by the instruction- 
pointer part of s.) It should be fairly clear that it is possible to define such a function, 
because most programming language instructions just change a few variables, i.e., 
replace a few elements of the sequence of integers representing the state. Control-fiow 
instructions (e.g., goto) just change the part of the state representing the instruction- 
pointer. 

Assuming we have the Next-State function, it is easy to complete the alignment. To 
do this we need to define the idea that, given states s and s', executing the program in 
state s ^vill eventually lead to a state s'. This just means that it is possible to get from 
s to s' by executing zero or more steps in the program, i.e., we can get from s to s' by 
zero or more applications of Next-State. This is easy to express mathematically as a 
recursive definition: 

Program?) eventually takes s to s' if and only if 



either s=s 



or there is a state s" such that 

s" = Next_State(p ,s) cind program p eventually takes s' to s' 

It is now simple to e:^ress the equation y - apply(p,x): 

y = apply(p,:r) if and only if program p eventually takes :r' to y 

where x' is z with the initial instruction-pointer appended in the proper place. 

This completes the argument that the halting problem can be expressed in a reason- 
ably powerful formal mathematical system. Thus, a reasonably powerful formcl 
mathematical system, if it's consistent, must also be incomplete. 

5. Relevance 

GodeTs Incompleteness Theorem is often misinterpreted. In popular accounts it is 
often represented as a proof of the inherent limitations of logic and mathematics. In 
some quarters it is used as an excuse for irrationality and mysticism. Thus it is impor- 
tant to consider the relevance of Gbdel's theorem, in particular to logic and mathemat- 
ics. 

Gbdel’s theorem has great relevance to computer science. In its form as the Halting 
Problem and its extensions, it tells us that we should not try to find algorithms that can 
infallibly decide certain questions about any given program. Thus, Gbdel’s theorem 
puts ultimate limits on the capabilities of computers. 

Vfiiat is the relevance of Gbdel's theorem to logic and mathematics? It’s major 
effect has been to destroy the formalist program in mathematics. A primary goal of 
that program was to prove, using mathematics, the consistency and completeness of 
mathematics; Gbdel showed that this cannot be done. Gbdel also showed that many 
other formal logical systems, such as three-valued logic, also suffer from the incom- 
pleteness property. This should not surprise us, since any reasonably powerful formal 
system is equivalent to a programming language, and so suffers from the undecidabil- 
ity theorems. 



Does this then indicate an inherent and unavoidable limitation to logic and 
mathematics? Only if you are a formalist. In essence, Gbdel’s theorem says that for’ 
malism ceinnot capture all of the power of mathematics and logic. 

Let's consider an example of the latter case. Consider Fermat's Last Theorem, which 
states that there are no integers x, y, and z such that for some n>2, x^+t/^=z^. This 
theorem has been neither proved nor disproved, although there is ample empirical evi- 
dence of its truth. Suppose that Fermat’s Last Theorem were known to be undecidable 
in some reasonably powerful formal mathematical system. We would then have to con- 
clude that the theorem is true. How can this be? Suppose (contrary to fact) that it 
were false. Then there would exist integers a, 5, c, and k>2 such that 
Now, in all reasonably powerful formal mathematiceJ systems the followng is a veJid 
deduction: 

1. k>2 

2. a^-H6^=c* 

3. Therefore, there exists x, z , and n>2 such that x^' 

The latter proposition is the contradictory of Fermat's Last Theorem. Hence, if 
Fermat's Last Theorem is false, then it mil be decidable in any reasonably powerful for- 
mal mathematical system. Conversely, if Fermat’s Last Theorem is undecidable in any 
reasonably pov/erful formal mathematical system, then the theorem must be true. 
QEa 

Vfhat have we done here? It seems that we’ve tedcen an undecidable theorem and 
decided it. More precisely, we’ve proved a theorem that was not provable mthin the 
system. Sometimes this kind of proof is called ” meta-mat hematic eJ" or “meta- 
Logical," but these are misnomers. The prefi:^ ‘meta-’, suggests that some sort of 
unusual process has been used. In fact we’ve just used the plain, old, garden-v-arieL/, 
Aristotelian logic that’s been known for 2500 years. Cur proof would be clear to Euclid. 
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Thus, there is nothing superior or esoteric about our meta-logical proof. It 7;-ould be 
more accurate to call the deductions of a formal logical system sub-Logica.1 proofs. 

The conclusion we should draw is that formal systems are not very good models for 
either mathematics or logic. Formal systems model situations where propositions can 
be deduced from other propositions without regard for the meanings of the terms in 
those propositions. One of the premises of formalism is that all mathematical truths 
can be derived by applying this formal deductive process to a fixed set of axioms (ini- 
tial strings). This has never been the case in mathematics. On the contrary, although 
there is little debate on the laws of algebra or the calculus, there is an interminable 
debate on the choice of the axioms upon which to found mathematics. 

The problem is that since the time of Euclid mathematics has been shewed as a 
purely dQductive science in which theorems are deduced from given axioms and 
definitions. In Euclid's time these axioms and definitions were considered self-evident; 
the formalists considered them arbitrary. Thus, the axioms and definitions are con- 
sidered the foundation of the edifice of mathematics. 

In this deductive view mathematics is very different from the inductive sciences like 
physics and chemistry. In these the basic laws are neither self-evident nor arbitrary. 
Rather, they are the result of a lengthy process of scientific investigation. They are not 
the foundations of these sciences; they are the capstones. 

In fact mathematics is more like the other sciences than is generally acknowledged. 
For eons man has known that apples drop from trees; it took Nev/ton to e:q)lain this in 
terms of the more basic ideas of mass and gravity. Similarly, for eons man has knovm 
that 2^2=4; mathematicians are still proposing explanations of this fact in terms of 
more basic ideas. Mathematics. like the other sciences, proceeds by a combination of 
induction and deduction. 

In summary, Gbdel’s Incompleteness Theorem is net a theorem about logic or 
mathematics; it is a theorem about programming. It places limits on the capabilities 
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of computers, not on the capabilities of mathematics or logic. Contrary to the com- 
mon notion that it demonstrates the impotence of reason, it is actually a sterling 
example of the power of reason. 
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